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Patent Application US/07/978,891 

SEQUENCE LISTING p{ / j^f -=> | £ 

1) GENERAL INFORMATION 

i) APPLICANT: Darrell Anderson, Nabil Hanna, John Leonard, Roland Newman and Mitchell R 

ii) TITLE OF INVENTION: THERAPEUTIC APPLICATION OF CHIMERIC ANTIBODY TO HUMAN B LYMPHOCY 

iii) NUMBER OF SEQUENCES: 8 

iv) CORRESPONDING ADDRESS: 

A) ADDRESSEE: IDEC Pharmaceuticals Corporation 

B) STREET: 11099 N. Torrey Pines Road, #160 

C) CITY: La Jolla 

D ) STATE : California 

E) COUNTRY: USA 

F) ZIP:92037 

v) COMPUTER READABLE FORM: 

A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 Mb 

B) COMPUTER: Macintosh 

C) OPERATING SYSTEM: MS. DOS 

D) SOFTWARE: Microsoft Word 5.0 
vZ-T — : — '/OS€^f -e^U ', r<p. 

vflCURRENT APPLICATION DATA: 1 

A) APPLICATION NUMBER: 

B) FILING DATE: 

C ) CLASSIFICATION : 

viii) ATTORNEY/AGENT INFORMATION: 

A ) NAME : Burgoon, Richard P. Jr. 

B) REGISTRATION NUMBER: 34,787 

C) REFERENCE/DOCKET NUMBER: 

ix) TELECOMMUNICATION INFORMATION: 



f 



A) TELEPHONE: (619) 458-0600 

B) TELE FAX: (619) 546-9274 




2 ) INFORMATION FOR SEQ/. \ IDf ;NO. : 
i) SEQUENCE CHARACTERISTICS: 

A) LENGTH: 8540 bases 

B) TYPE: nucleic acid 

C ) STRANDEDNESS : single 



1: 



f 
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PATENT APPLICATION US/07/978,891 TIME: 09:46:26 
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LINE ORIGINAL TEXT CORRECTED TEXT 
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( 1 ) GENERAL INFORMATION 








(1) 


GENERAL INFORMATION: 








12 


(iv) CORRESPONDING ADDRESS: 








(iv) 


CORRESPONDENCE ADDRESS: 






46 


(2) INFORMATION FOR SEQ. ID. 


NO. 


: 1 




(2) 


INFORMATION FOR SEQ ID 


NO: 


1: 




61 


(ix) SEQUENCE DESCRIPTION: 


SEQ. 


ID. 


NO. : 


(ix) 


SEQUENCE DESCRIPTION: 


SEQ 


ID. 


NO: 


351 


(3) INFORMATION FOR SEQ. ID. 


NO. 


: 2 




(2) 


INFORMATION FOR SEQ ID 


NO: 


2: 




366 


(ix) SEQUENCE DESCRIPTION: 


SEQ. 


ID. 


NO. : 


(ix) 


SEQUENCE DESCRIPTION: 


SEQ 


ID. 


NO: 


677 


(4) INFORMATION FOR SEQ. ID. 


NO. 


: 3 




(2) 


INFORMATION FOR SEQ ID 


NO: 


3: 




692 


(ix) SEQUENCE DESCRIPTION: 


SEQ. 


ID. 


NO. : 


(ix) 


SEQUENCE DESCRIPTION: 


SEQ 


ID. 


NO: 


698 


( 5 ) INFORMATION FOR SEQ. ID. 


NO. 


: 4 




(2) 


INFORMATION FOR SEQ ID 


NO: 


4: 




713 


(ix) SEQUENCE DESCRIPTION: 


SEQ. 


ID. 


NO. : 


(ix) 


SEQUENCE DESCRIPTION: 


SEQ 


ID. 


NO: 


719 


(6) INFORMATION FOR SEQ. ID. 


NO. 


: 5 




(2) 


INFORMATION FOR SEQ ID 


NO: 


5: 




734 


(ix) SEQUENCE DESCRIPTION: 


SEQ. 


ID. 


NO. : 


(ix) 


SEQUENCE DESCRIPTION: 


SEQ 


ID. 


NO: 


754 


( 7 ) INFORMATION FOR SEQ. ID. 


NO. 


: 6 




(2) 


INFORMATION FOR SEQ ID 


NO: 


6: 




769 


(ix) SEQUENCE DESCRIPTION: 


SEQ. 


ID. 


NO. : 


(ix) 


SEQUENCE DESCRIPTION: 


SEQ 


ID. 


NO: 


777 


(8) INFORMATION FOR SEQ. ID. 


NO. 


: 7 


• 


(2) 


INFORMATION FOR SEQ ID 


NO: 


7: 




792 


(ix) SEQUENCE DESCRIPTION: 


SEQ. 


ID. 


NO. : 


(ix) 


SEQUENCE DESCRIPTION: 


SEQ 


ID. 


NO: 


797 


(9) INFORMATION FOR SEQ. ID. 


NO. 


: 8: 


(2) 


INFORMATION FOR SEQ ID 


NO: 


8: 




812 


(ix) SEQUENCE DESCRIPTION: 


SEQ. 


ID. 


NO. : 


(ix) 


SEQUENCE DESCRIPTION: 


SEQ 


ID. 


NO: 
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MANDATORY IDENTIFIER THAT WAS NOT FOUND 



CURRENT APPLICATION DATA 

APPLICATION NUMBER 

FILING DATE 

CLASSIFICATION 

PRIOR APPLICATION DATA 

APPLICATION NUMBER 

FILING DATE 



PAGE: 1 SEQUENCE VERIFICATION REPORT DATE: 12/16/92 

PATENT APPLICATION US/07/978,891 TIME: 09:46:26 

S4267 

LINE ERROR ORIGINAL TEXT 



28 Response Exceeds Line Limitations 

30 Unknown or Misplaced Identifier 

31 Unknown or Misplaced Identifier 

32 Unknown or Misplaced Identifier 
695 Wrong Nucleic Acid Designator 

692 Entered and Calc. Seq. Length differ 

716 Wrong Nucleic Acid Designator 

734 Entered and Calc. Seq. Length differ 

772 Wrong Nucleic Acid Designator 

794 Wrong Nucleic Acid Designator 

792 Entered and Calc. Seq. Length differ 

812 Entered and Calc. Seq. Length differ 



(v\iOURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
^^^CfCLASSIFICATION: 

TC 3' 2 

( ix ) SEQUENCE DESCRIPTION: SEQ. ID. NO. : 
5' TGC AGC ATC CGT ACG TTT GAT TTC CAG C 
( ix ) SEQUENCE DESCRIPTION: SEQ. ID. NO. : 
5" GCG GCT CCC ACG CGT GTC CTG TCC CAG 3 
5" GG(G/C) TGT TGT GCT AGC TG(A/C) (A/G) 
( ix ) SEQUENCE DESCRIPTION: SEQ. ID. NO.: 
( ix ) SEQUENCE DESCRIPTION: SEQ. ID. NO. : 
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53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 
67 
68 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
90 
91 
92 
93 
94 
95 
96 
97 
98 
99 
100 
101 
102 
103 
104 




(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 

( ix ) SEQUENCE DESCRIPTION: SE$.j ID. NO\> 1: 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 360 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 

ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 540 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 780 

TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 840 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 

CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 

CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCTCCT GCTGCTCTGG 1020 

CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAATCA AACGTACGGT GGCTGCACCA 1080 

TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1140 

TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1200 

CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1260 



Page: 3 



Raw Sequence Listing 



Patent Application US/07/978,891 



12/16/92 
09:44:59 
S4267.raw 



105 
106 
107 
108 
109 
110 
111 
112 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 
123 
124 
125 
126 
127 
128 
129 
130 
131 
132 
133 
134 
135 
136 
137 
138 
139 
140 
141 
142 
143 
144 
145 
146 
147 
148 
149 
150 
151 
152 
153 
154 
155 
156 



AGCCTCAGCA 
TGCGAAGTCA 
TGTTGAATTC 
GGCCGTGATA 
GTTTGCCCCT 
TAATAAAATG 
GGGGTGGGGC 
GCGGTGGGCT 
ACGTCAATGA 
TTCCTACTTG 
GGCAGTACAT 
CCATTGACGT 
GTAACAACTC 
TAAGCAGAGC 
ATGGGTTGGA 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
TTCCTCTTCC 
TGCGTGGTGG 
GGCGTGGAGG 
CGTGTGGTCA 
TGCAAGGTCT 



GCACCCTGAC 
CCCATCAGGG 
AGATCCGTTA 
TCTACGTATG 
CCCCCGTGCC 
AGGAAATTGC 
AGGACAGCAA 
CTATGGAACC 
CGGTAAATGG 
GCAGTACATC 
CAATGGGCGT 
CAATGGGAGT 
CGCCCCATTG 
TGGGTACGTC 
GCCTCATCTT 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
CCCCAAAACC 
TGGACGTGAG 
TGCATAATGC 
GCGTCCTCAC 
CCAACAAAGC 



GCTGAGCAAA 
CCTGAGCTCG 
ACGGTTACCA 
ATCAGCCTCG 
TTCCTTGACC 
ATCGCATTGT 
GGGGGAGGAT 
AGCTGGGGCT 
CCCGCCTGGC 
TACGTATTAG 
GGATAGCGGT 
TTGTTTTGGC 
ACGCAAATGG 
CTCACATTCA 
GCTCTTCCTT 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
CAAGGACACC 
CCACGAAGAC 
CAAGACAAAG 
CGTCCTGCAC 
CCTCCCAGCC 



GCAGACTACG 
CCCGTCACAA 
ACTACCTAGA 
ACTGTGCCTT 
CTGGAAGGTG 
CTGAGTAGGT 
TGGGAAGACA 
CGACAGCTAT 
ATTATGCCCA 
TCATCGCTAT 
TTGACTCACG 
ACCAAAATCA 
GCGGTAGGCG 
GTGATCAGCA 
GTCGCTGTTG 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
GCACCTGAAC 
CTCATGATCT 
CCTGAGGTCA 
CCGCGGGAGG 
CAGGACTGGC 
CCCATCGAGA 



AGAAACACAA 
AGAGCTTCAA 
CTGGATTCGT 
CTAGTTGCCA 
CCACTCCCAC 
GTCATTCTAT 
ATAGCAGGCA 
GCCAAGTACG 
GTACATGACC 
TACCATGGTG 
GGGATTTCCA 
ACGGGACTTT 
TGTACGGTGG 
CTGAACACAG 
CTACGCGTGT 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGCAGAGCC 
TCCTGGGGGG 
CCCGGACCCC 
AGTTCAACTG 
AGCAGTACAA 
TGAATGGCAA 
AAACCATCTC 



AGTCTACGCC 
CAGGGGAGAG 
GACAACATGC 
GCCATCTGTT 
TGTCCTTTCC 
TCTGGGGGGT 
TGCTGGGGAT 
CCCCCTATTG 
TTATGGGACT 
ATGCGGTTTT 
AGTCTCCACC 
CCAAAATGTC 
GAGGTCTATA 
ACCCGTCGAC 
CGCTAGCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
ACCGTCAGTC 
TGAGGTCACA 
GTACGTGGAC 
CAGCACGTAC 
GGAGTACAAG 
CAAAGCCAAA 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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157 
158 
159 
160 
161 
162 
163 
164 
165 
166 
167 
168 
169 
170 
171 
172 
173 
174 
175 
176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 
188 
189 
190 
191 
192 
193 
194 
195 
196 
197 
198 
199 
200 
201 
202 
203 
204 
205 
206 
207 
208 



GGGCAGCCCC 
AACCAGGTCA 
TGGGAGAGCA 
GACGGCTCCT 
AACGTCTTCT 
CTCTCCCTGT 
TCGTGACAAC 
GCCAGCCATC 
CCACTGTCCT 
CTATTCTGGG 
GGCATGCTGG 
CCCGATCCCC 
AATTTTAACA 
AGACAGTGTT 
GACTCCTAAG 
GCCTGATTCC 
GCAGGAGCCA 
CTGACATAGT 
CAAACTTGAC 
GTTCGACCAT 
GACCTACCCT 
TCTTCAGTGG 
CCTGAGAAGA 
GAACCACCAC 
GAACAACCGG 
TACCAGGAAG 



GAGAACCACA 
GCCTGACCTG 
ATGGGCAGCC 
TCTTCCTCTA 
CATGCTCCGT 
CTCCGGGTAA 
ATGCGGCCGT 
TGTTGTTTGC 
TTCCTAATAA 
GGGTGGGGTG 
GGATGCGGTG 
AGCTTTGCTT 
CCAATTCAGT 
CTCTGCACAG 
CCAGTGAGTG 
GTAGAGCCAC 
GGGCAGAGCA 
TGTGTTGGGA 
GGCAATCCTA 
TGAACTGCAT 
GGCCTCCGCT 
AAGGTAAACA 
ATCGACCTTT 
GAGGAGCTCA 
AATTGGCAAG 
CCATGAATCA 



GGTGTACACC 
CCTGGTCAAA 
GGAGAACAAC 
CAGCAAGCTC 
GATGCATGAG 
ATGAGGATCC 
GATATCTACG 
CCCTCCCCCG 
AATGAGGAAA 
GGGCAGGACA 
GGCTCTATGG 
CTCAATTTCT 
AGTTGATTGA 
ATAAGGACAA 
GCACAGCATT 
ACCTTGGTAA 
TATAAGGTGA 
GCTTGGATAG 
GCGTGAAGGC 
CGTCGCCGTG 
CAGGAACGAG 
GAATCTGGTG 
AAAGGACAGA 
TTTTCTTGCC 
TAAAGTAGAC 
ACCAGGCCAC 



CTGCCCCCAT 
GGCTTCTATC 
TACAAGACCA 
ACCGTGGACA 
GCTCTGCACA 
GTTAACGGTT 
TATGATCAGC 
TGCCTTCCTT 
TTGCATCGCA 
GCAAGGGGGA 
AACCAGCTGG 
TATTTGCATA 
GCAAATGCGT 
ACATTATTCA 
CTAGGGAGAA 
GGGCCAATCT 
GGTAGGATCA 
CTTGGACAGC 
TGGTAGGATT 
TCCCAAAATA 
TTCAAGTACT 
ATTATGGGTA 
ATTAATATAG 
AAAAGTTTGG 
ATGGTTTGGA 
CTTAGACTCT 



CCCGGGATGA 
CCAGCGACAT 
CGCCTCCCGT 
AGAGCAGGTG 
ACCACTACAC 
ACCAACTACC 
CTCGACTGTG 
GACCCTGGAA 
TTGTCTGAGT 
GGATTGGGAA 
GGCTCGACAG 
ATGAGAAAAA 
TGCCAAAAAG 
GAGGGAGTAC 
ATATGCTTGT 
GCTCACACAG 
GTTGCTCCTC 
TCAGGGCTGC 
TTATCCCCGC 
TGGGGATTGG 
TCCAAAGAAT 
GGAAAACCTG 
TTCTCAGTAG 
ATGATGCCTT 
TAGTCGGAGG 
TTGTGACAAG 



GCTGACCAAG 
CGCCGTGGAG 
GCTGGACTCC 
GCAGCAGGGG 
GCAGAAGAGC 
TAGACTGGAT 
CCTTCTAGTT 
GGTGCCACTC 
AGGTGTCATT 
GACAATAGCA 
CGCTGGATCT 
AAGGAAAATT 
GATGCTTTAG 
CCAGAGCTGA 
CATCACCGAA 
GATAGAGAGG 
ACATTTGCTT 
GATTTCGCGC 
TGCCATCATG 
CAAGAACGGA 
GACCACAACC 
GTTCTCCATT 
AGAACTCAAA 
AAGACTTATT 
CAGTTCTGTT 
GATCATGCAG 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
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209 
210 
211 
212 
213 
214 
215 
216 
217 
218 
219 
220 
221 
222 
223 
224 
225 
226 
227 
228 
229 
230 
231 
232 
233 
234 
235 
236 
237 
238 
239 
240 
241 
242 
243 
244 
245 
246 
247 
248 
249 
250 
251 
252 
253 
254 
255 
256 
257 
258 
259 
260 



GAATTTGAAA GTGACACGTT TTTCCCAGAA ATTGATTTGG GGAAATATAA ACTTCTCCCA 4440 

GAATACCCAG GCGTCCTCTC TGAGGTCCAG GAGGAAAAAG GCATCAAGTA TAAGTTTGAA 4500 

GTCTACGAGA AGAAAGACTA ACAGGAAGAT GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 4560 

CTATGCATTT TTATAAGACC ATGGGACTTT TGCTGGCTTT AGATCAGCCT CGACTGTGCC 4620 

TTCTAGTTGC CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 4680 

TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG 4740 

GTGTCATTCT ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 4800 

CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 4860 

TAGCTTTGCT TCTCAATTTC TTATTTGCAT AATGAGAAAA AAAGGAAAAT TAATTTTAAC 4920 

ACCAATTCAG TAGTTGATTG AGCAAATGCG TTGCCAAAAA GGATGCTTTA GAGACAGTGT 4980 

TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 5040 

GCCAGTGAGT GGCACAGCAT TCTAGGGAGA AATATGCTTG TCATCACCGA AGCCTGATTC 5100 

CGTAGAGCCA CACCTTGGTA AGGGCCAATC TGCTCACACA GGATAGAGAG GGCAGGAGCC 5160 

AGGGCAGAGC ATATAAGGTG AGGTAGGATC AGTTGCTCCT CACATTTGCT TCTGACATAG 5220 

TTGTGTTGGG AGCTTGGATC GATCCTCTAT GGTTGAACAA GATGGATTGC ACGCAGGTTC 5280 

TCCGGCCGCT TGGGTGGAGA GGCTATTCGG CTATGACTGG GCACAACAGA CAATCGGCTG 5340 

CTCTGATGCC GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC CCGGTTCTTT TTGTCAAGAC 5400 

CGACCTGTCC GGTGCCCTGA ATGAACTGCA GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 5460 

CACGACGGGC GTTCCTTGCG CAGCTGTGCT CGACGTTGTC ACTGAAGCGG GAAGGGACTG 5520 

GCTGCTATTG GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA TCTCACCTTG CTCCTGCCGA 5580 

GAAAGTATCC ATCATGGCTG ATGCAATGCG GCGGCTGCAT ACGCTTGATC CGGCTACCTG 5640 

CCCATTCGAC CACCAAGCGA AACATCGCAT CGAGCGAGCA CGTACTCGGA TGGAAGCCGG 5700 

TCTTGTCGAT CAGGATGATC TGGACGAAGA GCATCAGGGG CTCGCGCCAG CCGAACTGTT 5760 

CGCCAGGCTC AAGGCGCGCA TGCCCGACGG CGAGGATCTC GTCGTGACCC ATGGCGATGC 5820 

CTGCTTGCCG AATATCATGG TGGAAAATGG CCGCTTTTCT GGATTCATCG ACTGTGGCCG 5880 

GCTGGGTGTG GCGGACCGCT ATCAGGACAT AGCGTTGGCT ACCCGTGATA TTGCTGAAGA 5940 
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261 
262 
263 
264 
265 
266 
267 
268 
269 
270 
271 
272 
273 
274 
275 
276 
277 
278 
279 
280 
281 
282 
283 
284 
285 
286 
287 
288 
289 
290 
291 
292 
293 
294 
295 
296 
297 
298 
299 
300 
301 
302 
303 
304 
305 
306 
307 
308 
309 
310 
311 
312 



GCTTGGCGGC 
GCAGCGCATC 
GAAATGACCG 
TTCTATGAAA 
CGCGGGGATC 
GGTTACAAAT 
TCTAGTTGTG 
ATCCCGTCGA 
CGCTCACAAT 
AATGAGTGAG 
ACCTGTCGTG 
TTGGGCGCTC 
GAGCGGTATC 
CAGGAAAGAA 
TGCTGGCGTT 
GTCAGAGGTG 
CCCTCGTGCG 
CTTCGGGAAG 
TCGTTCGCTC 
TATCCGGTAA 
CAGCCACTGG 
AGTGGTGGCC 
AGCCAGTTAC 
GTAGCGGTGG 
AAGATCCTTT 
GGATTTTGGT 



GAATGGGCTG 
GCCTTCTATC 
ACCAAGCGAC 
GGTTGGGCTT 
TCATGCTGGA 
AAAGCAATAG 
GTTTGTCCAA 
GAGCTTGGCG 
TCCACACAAC 
CTAACTCACA 
CCAGCTGCAT 
TTCCGCTTCC 
AGCTCACTCA 
CATGTGAGCA 
TTTCCATAGG 
GCGAAACCCG 
CTCTCCTGTT 
CGTGGCGCTT 
CAAGCTGGGC 
CTATCGTCTT 
TAACAGGATT 
TAACTACGGC 
CTTCGGAAAA 
TTTTTTTGTT 
GATCTTTTCT 
CATGAGATTA 



ACCGCTTCCT 
GCCTTCTTGA 
GCCCAACCTG 
CGGAATCGTT 
GTTCTTCGCC 
CATCACAAAT 
ACTCATCAAT 
TAATCATGGT 
ATACGAGCCG 
TTAATTGCGT 
TAATGAATCG 
TCGCTCACTG 
AAGGCGGTAA 
AAAGGCCAGC 
CTCCGCCCCC 
ACAGGACTAT 
CCGACCCTGC 
TCTCAATGCT 
TGTGTGCACG 
GAGTCCAACC 
AGCAGAGCGA 
TACACTAGAA 
AGAGTTGGTA 
TGCAAGCAGC 
ACGGGGTCTG 
TCAAAAAGGA 



CGTGCTTTAC 
CGAGTTCTTC 
CCATCACGAG 
TTCCGGGACG 
CACCCCAACT 
TTCACAAATA 
CTATCTTATC 
CATAGCTGTT 
GAAGCATAAA 
TGCGCTCACT 
GCCAACGCGC 
ACTCGCTGCG 
TACGGTTATC 
AAAAGGCCAG 
CTGACGAGCA 
AAAGATACCA 
CGCTTACCGG 
CACGCTGTAG 
AACCCCCCGT 
CGGTAAGACA 
GGTATGTAGG 
GGACAGTATT 
GCTCTTGATC 
AGATTACGCG 
ACGCTCAGTG 
TCTTCACCTA 



GGTATCGCCG 
TGAGCGGGAC 
ATTTCGATTC 
CCGGCTGGAT 
TGTTTATTGC 
AAGCATTTTT 
ATGTCTGGAT 
TCCTGTGTGA 
GTGTAAAGCC 
GCCCGCTTTC 
GGGGAGAGGC 
CTCGGTCGTT 
CACAGAATCA 
GAACCGTAAA 
TCACAAAAAT 
GGCGTTTCCC 
ATACCTGTCC 
GTATCTCAGT 
TCAGCCCGAC 
CGACTTATCG 
CGGTGCTACA 
TGGTATCTGC 
CGGCAAACAA 
CAGAAAAAAA 
GAACGAAAAC 
GATCCTTTTA 



CTCCCGATTC 
TCTGGGGTTC 
CACCGCCGCC 
GATCCTCCAG 
AGCTTATAAT 
TTCACTGCAT 
CGCGGCCGCG 
AATTGTTATC 
TGGGGTGCCT 
CAGTCGGGAA 
GGTTTGCGTA 
CGGCTGCGGC 
GGGGATAACG 
AAGGCCGCGT 
CGACGCTCAA 
CCTGGAAGCT 
GCCTTTCTCC 
TCGGTGTAGG 
CGCTGCGCCT 
CCACTGGCAG 
GAGTTCTTGA 
GCTCTGCTGA 
ACCACCGCTG 
GGATCTCAAG 
TCACGTTAAG 
AATTAAAAAT 



6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
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313 
314 
315 
316 
317 
318 
319 
320 
321 
322 
323 
324 
325 
326 
327 
328 
329 
330 
331 
332 
333 
334 
335 
336 
337 
338 
339 
340 
341 
342 
343 
344 
345 
346 
347 
348 
349 
350 
351 
352 
353 
354 
355 
356 
357 
358 
359 
360 
361 
362 
363 
364 



GAAGTTTTAA 
TAATCAGTGA 
TCCCCGTCGT 
TGATACCGCG 
GAAGGGCCGA 
GTTGCCGGGA 
TTGCTACAGG 
CCCAACGATC 
TCGGTCCTCC 
CAGCACTGCA 
AGTACTCAAC 
CGTCAATACG 
AACGTTCTTC 
AACCCACTCG 
GAGCAAAAAC 
GAATACTCAT 
TGAGCGGATA 
TTCCCCGAAA 



ATCAATCTAA 
GGCACCTATC 
GTAGATAACT 
AGACCCACGC 
GCGCAGAAGT 
AGCTAGAGTA 
CATCGTGGTG 
AAGGCGAGTT 
GATCGTTGTC 
TAATTCTCTT 
CAAGTCATTC 
GGATAATACC 
GGGGCGAAAA 
TGCACCCAAC 
AGGAAGGCAA 
ACTCTTCCTT 
CATATTTGAA 
AGTGCCACCT 



AGTATATATG 
TCAGCGATCT 
ACGATACGGG 
TCACCGGCTC 
GGTCCTGCAA 
AGTAGTTCGC 
TCACGCTCGT 
ACATGATCCC 
AGAAGTAAGT 
ACTGTCATGC 
TGAGAATAGT 
GCGCCACATA 
CTCTCAAGGA 
TGATCTTCAG 
AATGCCGCAA 
TTTCAATATT 
TGTATTTAGA 
8540 



AGTAAACTTG 
GTCTATTTCG 
AGGGCTTACC 
CAGATTTATC 
CTTTATCCGC 
CAGTTAATAG 
CGTTTGGTAT 
CCATGTTGTG 
TGGCCGCAGT 
CATCCGTAAG 
GTATGCGGCG 
GCAGAACTTT 
TCTTACCGCT 
CATCTTTTAC 
AAAAGGGAAT 
ATTGAAGCAT 
AAAATAAACA 



GTCTGACAGT 
TTCATCCATA 
ATCTGGCCCC 
AGCAATAAAC 
CTCCATCCAG 
TTTGCGCAAC 
GGCTTCATTC 
CAAAAAAGCG 
GTTATCACTC 
ATGCTTTTCT 
ACCGAGTTGC 
AAAAGTGCTC 
GTTGAGATCC 
TTTCACCAGC 
AAGGGCGACA 
TTATCAGGGT 
AATAGGGGTT 



TACCAATGCT 
GTTGCCTGAC 
AGTGCTGCAA 
CAGCCAGCCG 
TCTATTAATT 
GTTGTTGCCA 
AGCTCCGGTT 
GTTAGCTCCT 
ATGGTTATGG 
GTGACTGGTG 
TCTTGCCCGG 
ATCATTGGAA 
AGTTCGATGT 
GTTTCTGGGT 
CGGAAATGTT 
TATTGTCTCA 
CCGCGCACAT 



( 3 ) INFORMATION FOR SEQ. ID. NO.: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9209 bases 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 



7560 
7620 
7680 
7740 
7800 
7860 
7920 
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8040 
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8220 
8280 
8340 
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365 
366 
367 
368 
369 
370 
371 
372 
373 
374 
375 
376 
377 
378 
379 
380 
381 
382 
383 
384 
385 
386 
387 
388 
389 
390 
391 
392 
393 
394 
395 
396 
397 
398 
399 
400 
401 
402 
403 
404 
405 
406 
407 
408 
409 
410 
411 
412 
413 
414 
415 
416 



(ix) SEQUENCE DESCRIPTION: SEQ. ID. NO.: 2; 



GACGTCGCGG 
AGGCCGAGGC 
GGAGAATGGG 
ATGGTTGCTG 
GACTTTCCAC 
GGGGAGCCTG 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGACT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
CATCACAGAT 
GCTTCAGTCA 
GCATCTCCAG 
CACTGGTTCC 
CTGGCTTCTG 
ACAATCAGCA 
AACCCACCCA 
TCTGTCTTCA 



CCGCTCTAGG 
GGCCTCGGCC 
CGGAACTGGG 
ACTAATTGAG 
ACCTGGTTGC 
GGGACTTTCC 
AGTAATCAAT 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
CTCTCACTAT 
TAATGTCCAG 
GGGAGAAGGT 
AGCAGAAGCC 
GAGTCCCTGT 
GAGTGGAGGC 
CGTTCGGAGG 
TCTTCCCGCC 



CCTCCAAAAA 
TCTGCATAAA 
CGGAGTTAGG 
ATGCATGCTT 
TGACTAATTG 
ACACCCTAAC 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTGGG 
GGATTTTCAG 
AGGACAAATT 
CACAATGACT 
AGGATCCTCC 
TCGCTTCAGT 
TGAAGATGCT 
GGGGACCAAG 
ATCTGATGAG 



AGCCTCCTCA 
TAAAAAAAAT 
GGCGGGATGG 
TGCATACTTC 
AGATGCATGC 
TGACACACAT 
TTAGTTCATA 
GGCTGACCGC 
ACGCCAATAG 
TTGGCAGTAC 
AAATGGCCCG 
TACATCTACG 
GGGCGTGGAT 
GGGAGTTTGT 
CCATTGACGC 
TACGTGAACC 
GTGCAGATTA 
GTTCTCTCCC 
TGCAGGGCCA 
CCCAAACCCT 
GGCAGTGGGT 
GCCACTTATT 
CTGGAAATCA 
CAGTTGAAAT 



CTACTTCTGG 
TAGTCAGCCA 
GCGGAGTTAG 
TGCCTGCTGG 
TTTGCATACT 
TCCACAGAAT 
GCCCATATAT 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 
CCTGGCATTA 
TATTAGTCAT 
AGCGGTTTGA 
TTTGGCACCA 
AAATGGGCGG 
GTCAGATCGC 
TCAGCTTCCT 
AGTCTCCAGC 
GCTCAAGTGT 
GGATTTATGC 
CTGGGACTTC 
ACTGCCAGCA 
AACGTACGGT 
CTGGAACTGC 



AATAGCTCAG 
TGCATGGGGC 
GGGCGGGACT 
GGAGCCTGGG 
TCTGCCTGCT 
TAATTCCCCT 
GGAGTTCCGC 
CCGCCCATTG 
TTGACGTCAA 
TCATATGCCA 
TGCCCAGTAC 
CGCTATTACC 
CTCACGGGGA 
AAATCAACGG 
TAGGCGTGTA 
CTGGAGACGC 
GCTAATCAGT 
AATCCTGTCT 
AAGTTACATC 
CACATCCAAC 
TTACTCTCTC 
GTGGACTAGT 
GGCTGCACCA 
CTCTGTTGTG 



60 
120 
180 
240 
300 
360 
420 
480 
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660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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417 
418 
419 
420 
421 
422 
423 
424 
425 
426 
427 
428 
429 
430 
431 
432 
433 
434 
435 
436 
437 
438 
439 
440 
441 
442 
443 
444 
445 
446 
447 
448 
449 
450 
451 
452 
453 
454 
455 
456 
457 
458 
459 
460 
461 
462 
463 
464 
465 
466 
467 
468 



TGCCTGCTGA 
CTCCAATCGG 
AGCCTCAGCA 
TGCGAAGTCA 
TGTTGAATTC 
GGCCGTGATA 
GTTTGCCCCT 
TAATAAAATG 
GGGGTGGGGC 
GCGGTGGGCT 
ACGTCAATGA 
TTCCTACTTG 
GGCAGTACAT 
CCATTGACGT 
GTAACAACTC 
TAAGCAGAGC 
ATGGGTTGGA 
GTACAACTGC 
TGCAAGGCTT 
GGTCGGGGCC 
CAGAAGTTCA 
CAGCTCAGCA 
TACGGCGGTG 
GCTAGCACCA 
GGCACAGCGG 
TGGAACTCAG 



ATAACTTCTA 
GTAACTCCCA 
GCACCCTGAC 
CCCATCAGGG 
AGATCCGTTA 
TCTACGTATG 
CCCCCGTGCC 
AGGAAATTGC 
AGGACAGCAA 
CTATGGAACC 
CGGTAAATGG 
GCAGTACATC 
CAATGGGCGT 
CAATGGGAGT 
CGCCCCATTG 
TGGGTACGTC 
GCCTCATCTT 
AGCAGCCTGG 
CTGGCTACAC 
TGGAATGGAT 
AAGGCAAGGC 
GCCTGACATC 
ACTGGTACTT 
AGGGCCCATC 
CCCTGGGCTG 
GCGCCCTGAC 



TCCCAGAGAG 
GGAGAGTGTC 
GCTGAGCAAA 
CCTGAGCTCG 
ACGGTTACCA 
ATCAGCCTCG 
TTCCTTGACC 
ATCGCATTGT 
GGGGGAGGAT 
AGCTGGGGCT 
CCCGCCTGGC 
TACGTATTAG 
GGATAGCGGT 
TTGTTTTGGC 
ACGCAAATGG 
CTCACATTCA 
GCTCTTCCTT 
GGCTGAGCTG 
ATTTACCAGT 
TGGAGCTATT 
CACATTGACT 
TGAGGACTCT 
CAATGTCTGG 
GGTCTTCCCC 
CCTGGTCAAG 
CAGCGGCGTG 



GCCAAAGTAC 
ACAGAGCAGG 
GCAGACTACG 
CCCGTCACAA 
ACTACCTAGA 
ACTGTGCCTT 
CTGGAAGGTG 
CTGAGTAGGT 
TGGGAAGACA 
CGACAGCTAT 
ATTATGCCCA 
TCATCGCTAT 
TTGACTCACG 
ACCAAAATCA 
GCGGTAGGCG 
GTGATCAGCA 
GTCGCTGTTG 
GTGAAGCCTG 
TACAATATGC 
TATCCCGGAA 
GCAGACAAAT 
GCGGTCTATT 
GGCGCAGGGA 
CTGGCACCCT 
GACTACTTCC 
CACACCTTCC 



AGTGGAAGGT 
ACAGCAAGGA 
AGAAACACAA 
AGAGCTTCAA 
CTGGATTCGT 
CTAGTTGCCA 
CCACTCCCAC 
GTCATTCTAT 
ATAGCAGGCA 
GCCAAGTACG 
GTACATGACC 
TACCATGGTG 
GGGATTTCCA 
ACGGGACTTT 
TGTACGGTGG 
CTGAACACAG 
CTACGCGTGT 
GGGCCTCAGT 
ACTGGGTAAA 
ATGGTGATAC 
CCTCCAGCAC 
ACTGTGCAAG 
CCACGGTCAC 
CCTCCAAGAG 
CCGAACCGGT 
CGGCTGTCCT 



GGATAACGCC 
CAGCACCTAC 
AGTCTACGCC 
CAGGGGAGAG 
GACAACATGC 
GCCATCTGTT 
TGTCCTTTCC 
TCTGGGGGGT 
TGCTGGGGAT 
CCCCCTATTG 
TTATGGGACT 
ATGCGGTTTT 
AGTCTCCACC 
CCAAAATGTC 
GAGGTCTATA 
ACCCGTCGAC 
CCTGTCCCAG 
GAAGATGTCC 
ACAGACACCT 
TTCCTACAAT 
AGCCTACATG 
ATCGACTTAC 
CGTCTCTGCA 
CACCTCTGGG 
GACGGTGTCG 
ACAGTCCTCA 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
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469 
470 
471 
472 
473 
474 
475 
476 
477 
478 
479 
480 
481 
482 
483 
484 
485 
486 
487 
488 
489 
490 
491 
492 
493 
494 
495 
496 
497 
498 
499 
500 
501 
502 
503 
504 
505 
506 
507 
508 
509 
510 
511 
512 
513 
514 
515 
516 
517 
518 
519 
520 



GGACTCTACT 
TACATCTGCA 
AAATCTTGTG 
CCGTCAGTCT 
GAGGTCACAT 
TACGTGGACG 
AGCACGTACC 
GAGTACAAGT 
AAAGCCAAAG 
CTGACCAAGA 
GCCGTGGAGT 
CTGGACTCCG 
CAGCAGGGGA 
CAGAAGAGCC 
AGACTGGATT 
CTTCTAGTTG 
GTGCCACTCC 
GGTGTCATTC 
ACAATAGCAG 
GCTGGATCTC 
AGGAAAATTA 
ATGCTTTAGA 
CAGAGCTGAG 
ATCACCGAAG 
ATAGAGAGGG 
CATTTGCTTC 



CCCTCAGCAG 
ACGTGAATCA 
ACAAAACTCA 
TCCTCTTCCC 
GCGTGGTGGT 
GCGTGGAGGT 
GTGTGGTCAG 
GCAAGGTCTC 
GGCAGCCCCG 
ACCAGGTCAG 
GGGAGAGCAA 
ACGGCTCCTT 
ACGTCTTCTC 
TCTCCCTGTC 
CGTGACAACA 
CCAGCCATCT 
CACTGTCCTT 
TATTCTGGGG 
GCATGCTGGG 
CCGATCCCCA 
ATTTTAACAC 
GACAGTGTTC 
ACTCCTAAGC 
CCTGATTCCG 
CAGGAGCCAG 
TGACATAGTT 



CGTGGTGACC 
CAAGCCCAGC 
CACATGCCCA 
CCCAAAACCC 
GGACGTGAGC 
GCATAATGCC 
CGTCCTCACC 
CAACAAAGCC 
AGAACCACAG 
CCTGACCTGC 
TGGGCAGCCG 
CTTCCTCTAC 
ATGCTCCGTG 
TCCGGGTAAA 
TGCGGCCGTG 
GTTGTTTGCC 
TCCTAATAAA 
GGTGGGGTGG 
GATGCGGTGG 
GCTTTGCTTC 
CAATTCAGTA 
TCTGCACAGA 
CAGTGAGTGG 
TAGAGCCACA 
GGCAGAGCAT 
GTGTTGGGAG 



GTGCCCTCCA 
AACACCAAGG 
CCGTGCCCAG 
AAGGACACCC 
CACGAAGACC 
AAGACAAAGC 
GTCCTGCACC 
CTCCCAGCCC 
GTGTACACCC 
CTGGTCAAAG 
GAGAACAACT 
AGCAAGCTCA 
ATGCATGAGG 
TGAGGATCCG 
ATATCTACGT 
CCTCCCCCGT 
ATGAGGAAAT 
GGCAGGACAG 
GCTCTATGGA 
TCAATTTCTT 
GTTGATTGAG 
TAAGGACAAA 
CACAGCATTC 
CCTTGGTAAG 
ATAAGGTGAG 
CTTGGATAGC 



GCAGCTTGGG 
TGGACAAGAA 
CACCTGAACT 
TCATGATCTC 
CTGAGGTCAA 
CGCGGGAGGA 
AGGACTGGCT 
CCATCGAGAA 
TGCCCCCATC 
GCTTCTATCC 
ACAAGACCAC 
CCGTGGACAA 
CTCTGCACAA 
TTAACGGTTA 
ATGATCAGCC 
GCCTTCCTTG 
TGCATCGCAT 
CAAGGGGGAG 
ACCAGCTGGG 
ATTTGCATAA 
CAAATGCGTT 
CATTATTCAG 
TAGGGAGAAA 
GGCCAATCTG 
GTAGGATCAG 
TTGGACAGCT 



CACCCAGACC 
AGCAGAGCCC 
CCTGGGGGGA 
CCGGACCCCT 
GTTCAACTGG 
GCAGTACAAC 
GAATGGCAAG 
AACCATCTCC 
CCGGGATGAG 
CAGCGACATC 
GCCTCCCGTG 
GAGCAGGTGG 
CCACTACACG 
CCAACTACCT 
TCGACTGTGC 
ACCCTGGAAG 
TGTCTGAGTA 
GATTGGGAAG 
GCTCGACAGC 
TGAGAAAAAA 
GCCAAAAAGG 
AGGGAGTACC 
TATGCTTGTC 
CTCACACAGG 
TTGCTCCTCA 
CAGGGCTGCG 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
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521 
522 
523 
524 
525 
526 
527 
528 
529 
530 
531 
532 
533 
534 
535 
536 
537 
538 
539 
540 
541 
542 
543 
544 
545 
546 
547 
548 
549 
550 
551 
552 
553 
554 
555 
556 
557 
558 
559 
560 
561 
562 
563 
564 
565 
566 
567 
568 
569 
570 
571 
572 



ATTTCGCGCC 
GCCATCATGG 
AAGAACGGAG 
ACCACAACCT 
TTCTCCATTC 
GAACTCAAAG 
AGACTTATTG 
AGTTCTGTTT 
ATCATGCAGG 
CTTCTCCCAG 
AAGTTTGAAG 
CTCCTAAAGC 
GACTGTGCCT 
CCTGGAAGGT 
TCTGAGTAGG 
TTGGGAAGAC 
TCGAGCTACT 
AATTTTAACA 
AGACAGTGTT 
GACTCCTAAG 
GCCTGATTCC 
GCAGGAGCCA 
CTGACATAGT 
CGCAGGTTCT 
AATCGGCTGC 
TGTCAAGACC 



AAACTTGACG 
TTCGACCATT 
ACCTACCCTG 
CTTCAGTGGA 
CTGAGAAGAA 
AACCACCACG 
AACAACCGGA 
ACCAGGAAGC 
AATTTGAAAG 
AATACCCAGG 
TCTACGAGAA 
TATGCATTTT 
TCTAGTTGCC 
GCCACTCCCA 
TGTCATTCTA 
AATAGCAGGC 
AGCTTTGCTT 
CCAATTCAGT 
CTCTGCACAG 
CCAGTGAGTG 
GTAGAGCCAC 
GGGCAGAGCA 
TGTGTTGGGA 
CCGGCCGCTT 
TCTGATGCCG 
GACCTGTCCG 



GCAATCCTAG 
GAACTGCATC 
GCCTCCGCTC 
AGGTAAACAG 
TCGACCTTTA 
AGGAGCTCAT 
ATTGGCAAGT 
CATGAATCAA 
TGACACGTTT 
CGTCCTCTCT 
GAAAGACTAA 
TATAAGACCA 
AGCCATCTGT 
CTGTCCTTTC 
TTCTGGGGGG 
ATGCTGGGGA 
CTCAATTTCT 
AGTTGATTGA 
ATAAGGACAA 
GCACAGCATT 
ACCTTGGTAA 
TATAAGGTGA 
GCTTGGATCG 
GGGTGGAGAG 
CCGTGTTCCG 
GTGCCCTGAA 



CGTGAAGGCT 
GTCGCCGTGT 
AGGAACGAGT 
AATCTGGTGA 
AAGGACAGAA 
TTTCTTGCCA 
AAAGTAGACA 
CCAGGCCACC 
TTCCCAGAAA 
GAGGTCCAGG 
CAGGAAGATG 
TGGGACTTTT 
TGTTTGCCCC 
CTAATAAAAT 
TGGGGTGGGG 
TGCGGTGGGC 
TATTTGCATA 
GCAAATGCGT 
ACATTATTCA 
CTAGGGAGAA 
GGGCCAATCT 
GGTAGGATCA 
ATCCTCTATG 
GCTATTCGGC 
GCTGTCAGCG 
TGAACTGCAG 



GGTAGGATTT 
CCCAAAATAT 
TCAAGTACTT 
TTATGGGTAG 
TTAATATAGT 
AAAGTTTGGA 
TGGTTTGGAT 
TTAGACTCTT 
TTGATTTGGG 
AGGAAAAAGG 
CTTTCAAGTT 
GCTGGCTTTA 
TCCCCCGTGC 
GAGGAAATTG 
CAGGACAGCA 
TCTATGGAAC 
ATGAGAAAAA 
TGCCAAAAAG 
GAGGGAGTAC 
ATATGCTTGT 
GCTCACACAG 
GTTGCTCCTC 
GTTGAACAAG 
TATGACTGGG 
CAGGGGCGCC 
GACGAGGCAG 



TATCCCCGCT 
GGGGATTGGC 
CCAAAGAATG 
GAAAACCTGG 
TCTCAGTAGA 
TGATGCCTTA 
AGTCGGAGGC 
TGTGACAAGG 
GAAATATAAA 
CATCAAGTAT 
CTCTGCTCCC 
GATCAGCCTC 
CTTCCTTGAC 
CATCGCATTG 
AGGGGGAGGA 
CAGCTGGGGC 
AAGGAAAATT 
GATGCTTTAG 
CCAGAGCTGA 
CATCACCGAA 
GATAGAGAGG 
ACATTTGCTT 
ATGGATTGCA 
CACAACAGAC 
CGGTTCTTTT 
CGCGGCTATC 



4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
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573 
574 
575 
576 
577 
578 
579 
580 
581 
582 
583 
584 
585 
586 
587 
588 
589 
590 
591 
592 
593 
594 
595 
596 
597 
598 
599 
600 
601 
602 
603 
604 
605 
606 
607 
608 
609 
610 
611 
612 
613 
614 
615 
616 
617 
618 
619 
620 
621 
622 
623 
624 



GTGGCTGGCC 
AAGGGACTGG 
TCCTGCCGAG 
GGCTACCTGC 
GGAAGCCGGT 
CGAACTGTTC 
TGGCGATGCC 
CTGTGGCCGG 
TGCTGAAGAG 
TCCCGATTCG 
CTGGGGTTCG 
ACCGCCGCCT 
ATCCTCCAGC 
GCTTATAATG 
TCACTGCATT 
GCGGCCGCGA 
ATTGTTATCC 
GGGGTGCCTA 
AGTCGGGAAA 
GTTTGCGTAT 
GGCTGCGGCG 
GGGATAACGC 
AGGCCGCGTT 
GACGCTCAAG 
CTGGAAGCTC 
CCTTTCTCCC 



ACGACGGGCG 
CTGCTATTGG 
AAAGTATCCA 
CCATTCGACC 
CTTGTCGATC 
GCCAGGCTCA 
TGCTTGCCGA 
CTGGGTGTGG 
CTTGGCGGCG 
CAGCGCATCG 
AAATGACCGA 
TCTATGAAAG 
GCGGGGATCT 
GTTACAAATA 
CTAGTTGTGG 
TCCCGTCGAG 
GCTCACAATT 
ATGAGTGAGC 
CCTGTCGTGC 
TGGGCGCTCT 
AGCGGTATCA 
AGGAAAGAAC 
GCTGGCGTTT 
TCAGAGGTGG 
CCTCGTGCGC 
TTCGGGAAGC 



TTCCTTGCGC 
GCGAAGTGCC 
TCATGGCTGA 
ACCAAGCGAA 
AGGATGATCT 
AGGCGCGCAT 
ATATCATGGT 
CGGACCGCTA 
AATGGGCTGA 
CCTTCTATCG 
CCAAGCGACG 
GTTGGGCTTC 
CATGCTGGAG 
AAGCAATAGC 
TTTGTCCAAA 
AGCTTGGCGT 
CCACACAACA 
TAACTCACAT 
CAGCTGCATT 
TCCGCTTCCT 
GCTCACTCAA 
ATGTGAGCAA 
TTCCATAGGC 
CGAAACCCGA 
TCTCCTGTTC 
GTGGCGCTTT 



AGCTGTGCTC 
GGGGCAGGAT 
TGCAATGCGG 
ACATCGCATC 
GGACGAAGAG 
GCCCGACGGC 
GGAAAATGGC 
TCAGGACATA 
CCGCTTCCTC 
CCTTCTTGAC 
CCCAACCTGC 
GGAATCGTTT 
TTCTTCGCCC 
ATCACAAATT 
CTCATCAATC 
AATCATGGTC 
TACGAGCCGG 
TAATTGCGTT 
AATGAATCGG 
CGCTCACTGA 
AGGCGGTAAT 
AAGGCCAGCA 
TCCGCCCCCC 
CAGGACTATA 
CGACCCTGCC 
CTCAATGCTC 



GACGTTGTCA 
CTCCTGTCAT 
CGGCTGCATA 
GAGCGAGCAC 
CATCAGGGGC 
GAGGATCTCG 
CGCTTTTCTG 
GCGTTGGCTA 
GTGCTTTACG 
GAGTTCTTCT 
CATCACGAGA 
TCCGGGACGC 
ACCCCAACTT 
TCACAAATAA 
TATCTTATCA 
ATAGCTGTTT 
AAGCATAAAG 
GCGCTCACTG 
CCAACGCGCG 
CTCGCTGCGC 
ACGGTTATCC 
AAAGGCCAGG 
TGACGAGCAT 
AAGATACCAG 
GCTTACCGGA 
ACGCTGTAGG 



CTGAAGCGGG 
CTCACCTTGC 
CGCTTGATCC 
GTACTCGGAT 
TCGCGCCAGC 
TCGTGACCCA 
GATTCATCGA 
CCCGTGATAT 
GTATCGCCGC 
GAGCGGGACT 
TTTCGATTCC 
CGGCTGGATG 
GTTTATTGCA 
AGCATTTTTT 
TGTCTGGATC 
CCTGTGTGAA 
TGTAAAGCCT 
CCCGCTTTCC 
GGGAGAGGCG 
TCGGTCGTTC 
ACAGAATCAG 
AACCGTAAAA 
CACAAAAATC 
GCGTTTCCCC 
TACCTGTCCG 
TATCTCAGTT 



6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
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625 
626 
627 
628 
629 
630 
631 
632 
633 
634 
635 
636 
637 
638 
639 
640 
641 
642 
643 
644 
645 
646 
647 
648 
649 
650 
651 
652 
653 
654 
655 
656 
657 
658 
659 
660 
661 
662 
663 
664 
665 
666 
667 
668 
669 
670 
671 
672 
673 
674 
675 
676 



CGGTGTAGGT 
GCTGCGCCTT 
CACTGGCAGC 
AGTTCTTGAA 
CTCTGCTGAA 
CCACCGCTGG 
GATCTCAAGA 
CACGTTAAGG 
ATTAAAAATG 
ACCAATGCTT 
TTGCCTGACT 
GTGCTGCAAT 
AGCCAGCCGG 
CTATTAATTG 
TTGTTGCCAT 
GCTCCGGTTC 
TTAGCTCCTT 
TGGTTATGGC 
TGACTGGTGA 
CTTGCCCGGC 
TCATTGGAAA 
GTTCGATGTA 
TTTCTGGGTG 
GGAAATGTTG 
ATTGTCTCAT 
CGCGCACATT 



CGTTCGCTCC 
ATCCGGTAAC 
AGCCACTGGT 
GTGGTGGCCT 
GCCAGTTACC 
TAGCGGTGGT 
AGATCCTTTG 
GATTTTGGTC 
AAGTTTTAAA 
AATCAGTGAG 
CCCCGTCGTG 
GATACCGCGA 
AAGGGCCGAG 
TTGCCGGGAA 
TGCTACAGGC 
CCAACGATCA 
CGGTCCTCCG 
AGCACTGCAT 
GTACTCAACC 
GTCAATACGG 
ACGTTCTTCG 
ACCCACTCGT 
AGCAAAAACA 
AATACTCATA 
GAGCGGATAC 
TCCCCGAAAA 



AAGCTGGGCT 
TATCGTCTTG 
AACAGGATTA 
AACTACGGCT 
TTCGGAAAAA 
TTTTTTGTTT 
ATCTTTTCTA 
ATGAGATTAT 
TCAATCTAAA 
GCACCTATCT 
TAGATAACTA 
GACCCACGCT 
CGCAGAAGTG 
GCTAGAGTAA 
ATCGTGGTGT 
AGGCGAGTTA 
ATCGTTGTCA 
AATTCTCTTA 
AAGTCATTCT 
GATAATACCG 
GGGCGAAAAC 
GCACCCAACT 
GGAAGGCAAA 
CTCTTCCTTT 
ATATTTGAAT 
GTGCCACCT 



GTGTGCACGA 
AGTCCAACCC 
GCAGAGCGAG 
ACACTAGAAG 
GAGTTGGTAG 
GCAAGCAGCA 
CGGGGTCTGA 
CAAAAAGGAT 
GTATATATGA 
CAGCGATCTG 
CGATACGGGA 
CACCGGCTCC 
GTCCTGCAAC 
GTAGTTCGCC 
CACGCTCGTC 
CATGATCCCC 
GAAGTAAGTT 
CTGTCATGCC 
GAGAATAGTG 
CGCCACATAG 
TCTCAAGGAT 
GATCTTCAGC 
ATGCCGCAAA 
TTCAATATTA 
GTATTTAGAA 



ACCCCCCGTT 
GGTAAGACAC 
GTATGTAGGC 
GACAGTATTT 
CTCTTGATCC 
GATTACGCGC 
CGCTCAGTGG 
CTTCACCTAG 
GTAAACTTGG 
TCTATTTCGT 
GGGCTTACCA 
AGATTTATCA 
TTTATCCGCC 
AGTTAATAGT 
GTTTGGTATG 
CATGTTGTGC 
GGCCGCAGTG 
ATCCGTAAGA 
TATGCGGCGA 
CAGAACTTTA 
CTTACCGCTG 
ATCTTTTACT 
AAAGGGAATA 
TTGAAGCATT 
AAATAAACAA 



CAGCCCGACC 
GACTTATCGC 
GGTGCTACAG 
GGTATCTGCG 
GGCAAACAAA 
AGAAAAAAAG 
AACGAAAACT 
ATCCTTTTAA 
TCTGACAGTT 
TCATCCATAG 
TCTGGCCCCA 
GCAATAAACC 
TCCATCCAGT 
TTGCGCAACG 
GCTTCATTCA 
AAAAAAGCGG 
TTATCACTCA 
TGCTTTTCTG 
CCGAGTTGCT 
AAAGTGCTCA 
TTGAGATCCA 
TTCACCAGCG 
AGGGCGACAC 
TATCAGGGTT 
ATAGGGGTTC 



7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
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8700 
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9000 
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677 ( 4 ) INFORMATION FOR SEQ. ID. NO. : 3i 
678 

679 (i) SEQUENCE CHARACTERISTICS: 
680 

681 (A) LENGTH: 54 bases 

682 ( B ) TYPE : nucleic acid 

683 ( C ) STRANDEDNESS : single 

684 (D) TOPOLOGY: linear 
685 

686 (ii) MOLECULE TYPE: DNA (genomic) 
687 

688 (iii) HYPOTHETICAL: yes 
689 

690 (iv) ANTI-SENSE: no 
691 

692 (ix) SEQUENCE DESCRIPTION: SEg.JID. NO.: 3: 

693 

694/ 5 ' J ATC^AQA^GAT CTC TCA CCA TGG ATT TTC AGG TBC AGA TTA TCA GCTl>2 



69 

696 ^ , 

697 ' r ^Oj I ^ 

698 ( 5 ) INFORMATION FOR SEQ. ID. NO.: 
699 

700 (i) SEQUENCE CHARACTERISTICS: 
701 

702 (A) LENGTH: 30 bases 

703 (B)TYPE: nucleic acid 

704 (C) STRANDEDNESS: single 

705 (D) TOPOLOGY: linear 
706 

707 (ii) MOLECULE TYPE: DNA (genomic) 
708 

709 (iii) HYPOTHETICAL: yes 




712 




710 

711 (iv) ANTI-SENSE: yes 



>/ y TGC AGC ATC CGT ACG TTT GAT TTC CAG CTT/S^O 



713 (ix) SEQUENCE DESCRIPTION: SEQ. ID. NO. 
714 
715 
716/ 
717( 
718 

719 (6) INFORMATION FOR SEQ. ID. NO. : 5: 

720 

721 (i) SEQUENCE CHARACTERISTICS: 

722 

723 (A) LENGTH: 384 bases 

724 ( B ) TYPE : nucleic acid 

725 (C) STRANDEDNESS: single 
72 6 (D) TOPOLOGY: linear 
727 

728 (ii) MOLECULE TYPE: DNA (genomic) 
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729 
730 
731 
732 
733 
734 
735 
736 



752 
753 
754 
755 
756 
757 
758 
759 
760 
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763 
764 
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766 
767 
768 
769 
770 
771 
772, 
77 
774 
775 
776 
777 
778 
779 
780 



(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ. ID. NO. 



5: 



737 


ATG 


GAT 


TTT 


CAG 


GTG 


CAG 


ATT 


ATC 


AGC 


TTC 


CTG 


CTA 


ATC 


AGT 


GCT 


TCA 


738 


































739 


ATA 


ATG 


TCC 


AGA 


GGG 


CAA 


ATT 


GTT 


CTC 


TCC 


CAG 


TCT 


CCA 


GCA 


ATC 


CTG 


740 


































741 


GCA 


TCT 


CCA 


GGG 


GAG 


AAG 


GTC 


ACA 


ATG 


ACT 


TGC 


AGG 


GCC 


AGC 


TCA 


AGT 


742 


































743 


AGT 


TAC 


ATC 


CAC 


TGG 


TTC 


CAG 


CAG 


AAG 


CCA 


GGA 


TCC 


TCC 


CCC 


AAA 


CCC 


744 


































745 


ATT 


TAT 


GCC 


ACA 


TCC 


AAC 


CTG 


GCT 


TCT 


GGA 


GTC 


CCT 


GTT 


CGC 


TTC 


AGT 


746 


































747 


AGT 


GGG 


TCT 


GGG 


ACT 


TCT 


TAC 


TCT 


CTC 


ACA 


ATC 


AGC 


AGA 


GTG 


GAG 


GCT 


748 


































749 


GAT 


GCT 


GCC 


ACT 


TAT 


TAC 


TGC 


CAG 


CAG 


TGG 


ACT 


AGT 


AAC 


CCA 


CCC 


ACG 


750 


































751 


GGA 


GGG 


GGG 


ACC 


AAG 


CTG 


GAA 


ATC 


AAA384 
















;ta^5j^. 

CGG204- 
5GC255 

DTC^57 



( 7 ) INFORMATION FOR SEQ. ID. NO.: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANT I -SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ. ID. NO. 




GCG GCT CCC ACG CGT GTC CTG TCC CA< 



(8) INFORMATION FOR SEQ. ID. NO. 
( i ) SEQUENCE CHARACTER I ST I CS : 



0" 




4i 



L 



1/ 



• t 
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781 
782 
783 
784 
785 
786 
787 
788 
789 
790 
791 
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796 
797 
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799 
800 
801 
802 
803 
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805 
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808 
809 
810 
811 
812 
813 
814 
815 
816 
817 
818 
819 
820 
821 
822 
823 
824 
825 
826 
827 
828 
829 
830 
831 



(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: yes 



(ix) SEQUENCE DESCRIPTION: SEQ. ID. NO.: 7: 



5' GCy6/Ck TGT TGT GCT AGC TG ( (A/C) (A/G)GA GAC (G/Ap&T Ga{3^29 



( 9 ) INFORMATION FOR SEQ. ID. NO. : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 bases 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ. ID. NO.: 8: 



ATG GGT TGG AGC CTC ATC TTG CTC TTC CTT GTC GCT GTT GCT ACG CGT GTC 51 
CTG TCC CAG GTA CAA CTG CAG CAG CCT GGG GCT GAG CTG GTG AAG CCT GGGl02_ 
GCC TCA GTG AAG ATG TCC TGC AAG GCT TCT GGC TAC ACA TTT ACC AGT TAC153 
AAT ATG CAC TGG GTA AAA CAG ACA CCT GGT CGG GGC CTG GAA TGG ATT GGA^2JJ4- 
GCT ATT TAT CCC GGA AAT GGT GAT ACT TCC TAC AAT CAG AAG TTC AAA GGC2^p-- 
AAG GCC ACA TTG ACT GCA GAC AAA TCC TCC AGC ACA GCC TAC ATG CAG CTC306 



AGC AGC CTG ACA TCT GAG GAC TCT GCG GTC TAT TAC TGT GCA AGA TCG ACT3^7— 

TAC TAC GGC GGT GAC TGG TAC TTC AAT GTC TGG GGC GCA GGG ACC ACG GTC408 
ACC GTC TCT GCA 420 



